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This work studies the following question: can plays in a Muller game be stopped after a finite number 
of moves and a winner be declared. A criterion to do this is sound if Player wins an infinite-duration 
Muller game if and only if she wins the finite-duration version. A sound criterion is presented that 
stops a play after at most 3" moves, where n is the size of the arena. This improves the bound 
(n! + 1 )" obtained by McNaughton and the bound «! + 1 derived from a reduction to parity games. 

1 Introduction 

In an infinite game, two players move a token through a finite graph thereby building an infinite path. 
The winner is determined by a partition of the infinite paths tlirough the arena into the paths that are 
winning for Player or winning for Player 1, respectively. Many winning conditions in the literature 
depend on the vertices that are visited infinitely often, i.e., the winner of a play cannot be determined 
after any finite number of steps. We are interested in the following question: is it nevertheless possible to 
give a criterion to define a finite-duration variant of an infinite game. Such a criterion has to stop a play 
after a finite number of steps and then declare a winner based on the finite play constructed thus far. It is 
sound if Player has a winning strategy for the infinite-duration game if and only if she has a winning 
strategy for the finite-duration game. 

McNaughton considered the problem of playing infinite games in finite time from a different per- 
spective. His motivation was to make infinite games suitable for "casual living room recreation" [8|. As 
human players cannot play infinitely long, he envisions a referee that stops a play at a certain time and 
declares a winner The justification for declaring a winner is that "if the play were to continue with each 
[player] playing forever as he has so far, then the player declared to be the winner would be the winner 
of the infinite play of the game" IH. 

Besides this recreational aspect of infinite games there are several interesting theoretical questions 
that motivate investigating this problem. If there exists a sound criterion to stop a play after at most n 
steps, this yields a simple algorithm to determine the winner of the infinite game: the finite-duration 
game can be seen as a reachability game on a finite tree of depth at most n that is won by the same 
player that wins the infinite-duration game. There exist simple and efficient algorithms to determine the 
winner in reachability games on trees. Furthermore, a positive answer to the question whether a winning 
strategy for the reachability game can be turned into a (small finite-state) winning strategy should yield 
better results in the average (although not in the worst case) than game reductions, which ignore the 
structure of the arena. 

Consider the following criterion: the players move the token through the arena until a vertex is visited 
for the second time. An infinite play can then be obtained by assuming that the players continue to play 
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the cycle that they have constructed. Then, the winner of the infinite play is declared to be the winner of 
the finite play. If the game is determined with positional strategies for both players, then this procedure 
is correct: if a player has a winning strategy for the infinite game, which can be assumed to be positional, 
then she can use the same strategy to win the finite version of the game and vice versa. 

Therefore, McNaughton proposes that we should consider games that are in general not positionally 
determined. Here, the first loop of a play is typically not an indicator of how the infinite play evolves, 
as the memory allows a player to make different decisions when a vertex is seen again. Therefore, the 
players have to play longer before the play can be stopped and analyzed. 

McNaughton considers MuUer games, which are games of the form {G,^o,^i), where G is a finite 
arena and (^o>=^i) is a partition of the set of vertices. Player / wins a play, if the set of vertices visited 
infinitely often by this play is in Muller winning conditions allow us to express all other winning 
conditions that depend only in the infinity set of a play (e.g., Biichi, co-Buchi, parity, Rabin, and Streett 
conditions). 

To give a sound criterion for Muller games, McNaughton defines for every set of vertices F a scoring 
function Sc^ that keeps track of the number of times the set F was visited entirely since the last visit of a 
vertex that is not in F. In an infinite play, the set of vertices seen infinitely often is the unique set F such 
that Scf will tend to infinity with being reset to only finitely often. 



Let G be the arena in Figure [T] (Player O's vertices are shown as circles and Player I's vertices are 
shown as squares) and the Muller game = {G,.^q,^i) with = {{0, 1,2}, {0}, {2}}. In the play 
100122121 we have that the score for the set {1,2} is 3, as it was seen thrice (i.e., with the infixes 12, 
21, and 21). Note that the order of the visits to the elements of F is irrelevant and that it is not required 
to close a loop in the arena. The following winning strategy for Player bounds the scores of Player 1 
by 2: arriving from at 1 move to 2 and vice versa. However, Player cannot avoid a score of 2 for her 
opponent, as either the play prefix 1001 or 1221 is consistent with every winning strategy. 

By using finite-state determinacy of Muller games, McNaughton suggests that the criterion should 
stop a play after a score of |f | ! + 1 for some set F is reached. He shows that picking the winner to be the 
Player / such that F G is indeed sound. 

Applying finite-state determinacy one can also show that one can soundly declare a winner after at 
most |G[! + 1 steps, as a repetition of a memory state has occurred after that many steps. Note that for 
large sets F, it could take far more than |G| ! + 1 steps to reach a score of |F| ! + 1, as scores can increase 
slowly or can even be reset to 0. However, to decide whether a memory state repetition has occurred, 
it might be necessary to compute the complete memory structure for the given game, which is of size 
|G| !. Keeping track of scores is much simpler, as they can be computed on the fly while the play is being 
played. Also, there are at most [G| sets F with non-zero score. 

Our contribution. We show that declaring the winner of a play as soon as the score 3 is reached for the 
first time is a sound criterion. We complement this by proving that a score of 3 is reached after at most 
31*^1 steps. Hence, we obtain a better bound than [G| ! + 1, which was derived from waiting for repetitions 
of memory states. 




Figure 1 : The arena G. 
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Our results are obtained by using Zielonka's algorithm |[T0]| (a reinterpretation of an earlier algorithm 
by McNaughton [7]) for computing winning regions in Muller games. We carefully define a winning 
strategy that bounds the scores of the opponent by 2. In the example above, the winning player cannot 
avoid a score of 2 for the opponent. Hence, in this sense our result is optimal. 

Related work. Usually, the quality of a strategy is measured in terms of memory needed to implement 
it. However, there are other natural quality measures of winning strategies. In [2], the authors study 
a strengthening of parity (and Streett) objectives, which require that there is some bound between the 
occurrences of even colors. Another application of this concept appears in work on request response 
games HKTTl, where waiting times between requests and subsequent responses are used to define the 
value of a play. There it is shown that time-optimal winning strategies can be computed effectively. 

The maximal score achieved by the opponent in a play can be used to measure the quality of winning 
plays in a Muller game. Player prefers plays in which the scores of her opponent are small. This 
corresponds to not spending a prolonged amount of time in a set of the opponent, but visiting every 
vertex that is seen infinitely often without large gaps. 

This paper is structured as follows. Section |2] contains basic definitions and fixes our notation. In 
Section [3l we introduce the scoring functions, prove some properties about scoring and define finite-time 
Muller games. In Section |4l we present Zielonka's algorithm which is used in Section |5] to prove the 
main result. Section[6]ends the paper with a conclusion and some pointers to further research. 

2 Definitions 

The power set of a set S is denoted by 2^ and N denotes the non-negative integers. The prefix relation on 
words is denoted by C, its strict version by C. Given a word w = xy, define x^^w = y and wy^^ = x. 

An arena G = {V,Vq,V\,E) consists of a finite, directed graph {V,E) of vertices and a partition 
(^0;^i) of y denoting the positions of Player (drawn as circles) and Player 1 (drawn as squares). 
We require that every vertex has at least one outgoing edge. A set X C V induces the subarena G[X] = 
{V r\X,Vor\X,Vir\X,En{X xX)),if every vertex inX has at least one successor in X. A Muller game 
^ = (G, ^0,^1 ) consists of an arena G and a partition {^o,^i ) of 2^. 

A play in G starting in v € V is an infinite sequence p = poPiPi ■ ■ ■ such that po = v and (p„ , p„_|_i ) € E 
for all « G N. The occurrence set Occ(p) and infinity set Inf(p) of p are given by Occ(p) = {v \ 3n€ 
N such that p„ = v} and Inf(p) = {v G V | B'^n G N such that p„ = v}. We will also use the occurrence 
set of a finite play w. A play p in a Muller game is winning for Player / if Inf(p) G 

A strategy for Player / is a function a: V*Vi V satisfying {s,a{ws)) G E for all ws G V*Vi. The 
play p is consistent with a if p„+i = a(po • . • p„) for every « G N with p„ G V,. The set of strategies for 
Player / is denoted by IT;. A strategy is called finite-state, if it can be implemented by an automaton with 
output that reads finite plays and outputs the vertex to move to. We will say that a finite-state strategy is 
of size n, if there exists an automaton with n states that implements it. 

A strategy a for Player / is a winning strategy from a vertex v G V, if every play that starts in v and 
is consistent with a is won by Player /. The strategy a is a winning strategy for a set of vertices W <^V, 
if every play that starts in some v G W and is consistent with a is won by Player /. The winning region 
Wi of Player / contains all vertices, from which she has a winning strategy. A game is determined if Wq 
and W\ form a partition of V. 

Theorem 1 (|[Il|5l|71). Muller games are determined with finite-state strategies of size n-n\, where n 
denotes the size of the arena. 
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Let G = {V,Vo,Vi,E) be an arena and let X C y be a set that induces a subarena. The attractor for 
Player i of a set F C y in X is Attrf (F) = Ui=o^« "^^^^^ Aq = F nX and 

A„+i =A„U{v eViHX \ 3v' eA„ such that (v, v')eE} 

U {v G Vi-iHX \W ex with (v, v') eE:v' eA„} . 

A X C y is a trap for Player i, if all outgoing edges of the vertices in V/ nX lead to X and at least one 

successor of every vertex in V\-i nX is in X. 

Lemma 2. Let G = (y, yo, yi ,£") be an arena and F,X QV. 

1. For every v G Attrf (F) Player i has a positional strategy to bring the play into F. 

2. The set y \ Attrf (F) induces a subarena and is a trap for Player i in G. 

3 The Scoring Functions and Finite-time Muller Games 

This section introduces the notions that are required to formally define finite-time Muller games. In his 
study of these games, McNaughton introduced the concept of a score. For every set of vertices F we 
define the score of a finite play w to be the number of times that F has been visited entirely since w last 
visited a vertex in y \ F. 

Definition 3 (Score). For every F CV we define Scf : y+ — )■ N 

Scf(w) = max{A: G N | . . . G such that Occ(x,) = F for all i andx\ ■ ■ -Xk is a suffix ofw}. 

We extend this notion by introducing the concept of an accumulator. For every set F, the accumulator 
measures the progress that has been made towards the next score increase of F. 

Definition 4 (Accumulator). For every F QV we define Accf : y+ — 2^ by Kccf{w) = Occ(x), where 
X is the longest suffix ofw such that Scp(w) = Scp{wy^^) for every suffix y of x, and Occ(x) C F. 

Finally we define the maximum score function. This function maps a subset =^ C 2^ and a play p to 
the highest score that is reached during p for a set contained in J^. 

Definition 5 (MaxScore). For every ^ C 2^ we define MaxSc ^ : y + uy ® NU {oo} MaxSc ^ (p) = 
maxf g maxvycp Sc/r (w). 

McNaughton proposes that scores should be used to decide the winner in a finite-time Muller game. 
As soon as a threshold score of k for some set F is reached, the play is stopped and Player / is declared 
the wiimer, if F G The next lemma shows that this condition is sufficient to ensure that the game 
terminates after a finite number of steps. 

Lemma 6. Let G be an arena with vertex set V. Every w eV* with \w\ > fc'^' satisfies MaxSc2v (w) > k. 

Proof. We will show by induction over \V\ that every word w eV* with |w| > ^1^1 contains an infix x 
that can be decomposed as x = xi ■ • -x^ where every x; is a non-empty word with Occ(x,) = Occ(x). This 
will imply MaxSc2v (w) > k. 

The claim holds trivially for |y | = 1 by choosing x to be the prefix of w of length k and x,- = s for the 
single vertex s eV . For the induction step, consider a set V with n-|- 1 vertices. If w contains an infix x 
of length ^" which contains at most n distinct vertices, then we can apply the inductive hypothesis and 
obtain a decomposition of an infix of v with the desired properties. Otherwise, every infix x of w of length 
k^ contains every vertex of V at least once. Let x be the prefix of length of w and let x = xi • • • x^ be 
the decomposition of x such that each x, is of length fc". Then, we have Occ(x/) = Occ(x) = V for all i. 
Therefore, the decomposition has the desired properties. □ 
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Lemma |6] implies that a finite-time Muller game with threshold k must end after at most k^^^ steps. 
We can also show that this bound is tight. For every > we give an inductive definition of a word 
over the alphabet £„ = {1, . . . ,n} by H'(,t.i) = l'^^' and = (h'(j.„_i)?i)'^^'w(j.„_i). Clearly, the word 
has length k" — I, and it can also be shown that MaxSc2L„ (w) < k. 

Finally, to declare a unique winner in every finite-time Muller game we must exclude the case where 
there are two sets such that both sets hit score k at the same time. McNaughton observed that for A; > 2 
the first set to hit score k will be unique. Before we reprove this, we will first show a useful auxiliary 
result that will also be used later in the paper. 

Lemma 7 (cf. Theorem 4.2 of [8|). Let w G V^. The sets F with Scf (w) > 1 together with the sets 
Accp{w) for some F form a chain with respect to the subset relation. 

Proof. It suffices to show that all such sets are pairwise comparable: let F and F' be two sets such 
that either Scf (w) > 1 or f = Acc//(w) for some H QV and either Scfi{w) > 1 or F' = Acc//'(w) for 
some H' C V . Then, there exist two decompositions w = wqW\ and w = w'qw\ with Occ(wi) = F and 
Occ(wj) = F' . Now, either wi is a suffix of w\ or vice versa. In the first case, we have F C F' and in the 
second case F' QF. □ 

Note that Lemma |7] implies that there are at any time at most \V\ sets with non-zero scores. 

Lemma 8 (lEl). Let k,l>2, let F,F' C V, let weV* and v € V such that Scf{w) < k and Scfi{w) < I. 
IfScpiwy) = k and Scf'(wv) = /, then F = F'. 

Proof. Towards a contradiction assume F ^ F'. By Lemma |7] we can assume F' C F, i.e., there exists 
some q ^ F\F'. Then, Scf (wv) = k and Scf'(wv) = / imply the existence of decompositions wv = 
wqWi ■■■Wk and wv = w'qw[ ■■■w\ such that Occ(w,) = F and Occ(w-) = F' for all / > 1. As q ^ F', 
w\ • • - wj is a proper suffix of Wk. Furthermore, as Scp{w) < k, we have v ^ Occ{wkV^^). However, we 
have V £ F' and hence v € Occ(h'[_j), which is an infix of wtv^^. This yields the desired contradiction. 

□ 

We are now in a position to define a finite-time Muller game. Such a game = {G,^o,^i,k) 
consists of an arena G = {V,Vo,Vi,E), a. partition {^o,^i) of 2^, and a threshold k>2. By Lemma |6] 
we have that every infinite play must reach score k for some set F after a bounded number of steps. 
Therefore, we define a play for the finite-time Muller game to be a finite path w = wq • • • w„ with 
MaxSc2v(wo- • -Wn) = k, but MaxSc2v(wo • • • w„_i) < k. Due to Lemma [8j there is a unique F CV 
such that Sc f{w) = k. Player wins the play w if F £ and Player 1 wins otherwise. The definitions 
of strategies, plays, and winning sets can be redefined for the finite games. 

Zermelo [9J has shown that a game in which every play is finite is determined. Therefore, it imme- 
diately follows that finite Muller games are determined. 

Lemma 9. Finite-time Muller games are determined. 

In fact, McNaughton considered a slightly different definition of a finite-time Muller game. Rather 
than stopping the play when the score of a set reaches the global threshold k, his version stops the play 
when the score of a set F reaches |F| ! + 1. 

Theorem 10 (|8|). IfWi is the winning region of Player i in a Muller game {G,^o,,^i), and W/ is the 
winning region of Player i in McNaughton's finite-time Muller game, then Wi = Wj. 
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4 Zielonka's Algorithm For MuUer Games 

This section presents Zielonka's algorithm for Muller games ifTOl . a reinterpretation of an earlier algo- 
rithm due to McNaughton |7|. Our notation mostly follows lllH. We will use the internal structure 
of the winning regions as computed by the algorithm to define a strategy that bounds the scores of the 
losing player by 2. 

As we consider uncolored arenas, we have to deal with Muller games where (^o,^i) is a partition 
of 2^ for some finite set V' 5 V, as the algorithm makes recursive calls for such games. This does not 
change the semantics of Muller games, as we have Inf(p) C V for every infinite play p. 

We begin by introducing Zielonka trees, a representation of winning conditions (^Oi-^i)- Given a 
family of sets ^ C 2^' and X C V', we define ^ \ X = {F e ^ \ F C X}. Given a partition (^o, ^i) 
of 2^', we define (=^o,=^i) \ X = {^o \X,^i fX). Note that ^ \ XC^. 

Definition 11 (Zielonka tree). For every winning condition (^Oi^i) defined over a set V', its Zielonka 
tree is defined as follows: suppose that V' E and let VqjVj', . . . , V^'_j be the C-maximal sets 

in ^i-i. The tree consists of a root vertex labelled by V' with k children which are defined by 

For every Zielonka tree T, we define RtLbl(r) to be the label of the root in T, we define BrnchFctr(r) 
to be the number of children that the root has in T, and we define Chld(r,7) for < j < BmchFctr(r) to 
be the j-th child of the root in T. Here, we assume that the children of every vertex are ordered by some 
fixed linear order. 

The input of Zielonka's algorithm (see Algorithm [B is a finite arena G with vertex set V and the 
Zielonka tree of a partition {^o,^i) of 2^ for some finite set V' ^ V. The algorithm computes the 
winning regions of the players by successively removing parts of Player O's winning region (the sets 
Uo,U\,U2, ■ ■ ■)■ By doing this, the algorithm computes an internal structure of the winning regions that 
will be crucial to proving our results in the next section. 

For the rest of this paper we will refer to the sets of vertices and the subtrees of ^^g,^^ as computed 
by the algorithm. 

Figure |2] depicts the situation in the ?i-th iteration of the algorithm. The vertices in Un-i have already 
been removed and belong to Then, all vertices in the (1 — /)-attractor of Un-\ also belong to 
After removing these vertices from the arena, the algorithm also removes the vertices in the /-attractor of 
RtLbl(r„). The remaining vertices form a subarena whose vertex set is a subset of RtLbl(r„). Hence, the 
algorithm can recursively compute the winning regions W" in this subarena with Zielonka tree r„. By 
construction, the winning region W"_j is also a subset of This is repeated until the sets converge 
to Wi^i. All remaining vertices belong to W,-. 

Furthermore, we have the following properties that will be used in the next section. Let n denote 
the index at which Zielonka's algorithm terminated. The sets Wy_j for j < n are obviously disjoint. 
However, the sets W" ^ for j in the range n — k < j < n might overlap. Player / can confine a play in 
W" ^ until Player 1 — / decides to leave this set. However, his only choice is to move to a vertex in 
Attr,- ""^ (y \ RtLbl(r„_y)), as he can neither move to a vertex in A„ = A„_y {X„ = X„^j is a trap for him) 
nor to a vertex in = 0. This implies that Player / can force the play to visit V \ RtLbl(r„_y), if 
Player 1 — / decides to leave W" ^ . 

Theorem 12 ([10|). Algorithm\l}terminates with a partition (Wo,Wi), where Player has a winning 
strategy for Wq and Player 1 has a winning strategy for Wi. 

Zielonka's winning strategies are defined inductively: Player 1 — / plays an attractor strategy to ?7„_i 
on A„ \Un-i and on each W"_i according to the winning strategy computed recursively. A play consistent 
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Algorithm 1 Zielonka(G, iF^g^^,). 

/ := The index j such that RtLbI(.2^Q ) G 

k := BrnchFctr(iF^Q^^j) 

if The root of i^^^o,^, has no children then 

return(Wb,lVi) 
end if 

:=0;«:=O 
repeat 

n:=n+\ 

An:=Attx\_.{Un-,) 
X„:=V\An 

T„ := Chld(^^Q , n mod k) 
Yn:=X„\Attvf\v\mUA{Tn)) 
(Wo«,Wi") :=Zielonka(G[F„],r„) 

until Un = Un- !=••• = Un-k 

return {Wo,Wi) 



with this strategy will from some point onwards be consistent with one of the winning strategies for some 
hence it is winning for Player 1 — /. 

Player / plays using a cyclic counter c: suppose c = j. In W" \ she plays according to the winning 
strategy computed recursively. If Player 1 — / chooses to leave W"^, then she starts playing an attractor 
strategy to reach V \RtLbl(r„_y). Once she has reached this set she increments c modulo k and begins 
again. There are two possibilities for a play consistent with this strategy: if it stays from some point 
onwards in some W" \ then it is winning by the inductive hypothesis. Otherwise, it will visit infinitely 
many vertices in V \RtLbl(Chld(.2^Q^j ,7')) for every j in the range < j < BrnchFctr(^^g ), which 
implies that the infinity set of the play is not a subset of any RtLbl(Chld(.^^o.^^, , j')). Hence, it is in 
and the play is indeed winning for Player /. 

We conclude this section by showing that the winning strategies for MuUer games as defined in |[TOl 
do not bound the score of the opponent by a constant. 









Attrf"(y\RtLbl(r„)) 


Un-l \ 






y\RtLbi(r„) 



Figure 2: The sets computed by Zielonka's algorithm. 
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Lemma 13. There exists a family of Muller games = (G„ , > =^1" ) w/f/i | G„ | = « + 1 and \ | = 1 
such that MaxSc,^" (Play (v, CJ, t)) =n where o is Zielonka 's strategy, v €zV, and T € Hi. 




Figure 3: The arena G„ for Lemma [T3l 



Proof Let G„ = (y„,y„,0,£„) with y„ = {0, . . . ,«}, E„ = {(/ + 1,/) | / < U {(0,«), (1,«)} (see Fig- 
ure (3]), and = {V„}. The Zielonka tree for the winning condition (^q ,^") has a root labeled by 
and n + l children that are leaves and are labeled by V,, \ {/} for every i ^Vn- Assume, the children are 
ordered as follows: \ {0} < • • • < V,, \ {n}. Zielonka's strategy for which depends on the ordering 
of the children, can be described as follows. Initialize a counter c : = and repeat the following: 

1. Use an attractor strategy to move to vertex c. 

2. Increment c modulo n + l. 

3. GotoE 

Now assume a play consistent with this strategy has just visited 0. Then, it visits all vertices 1, . . . ,« in 
this order by cycling through the loop 1 « times. Hence, the score for the set { 1 ,...,«} is infinitely 
often n. □ 

By contrast, Player has a positional winning strategy for that bounds the opponents scores by 2. 
The reason the strategy described above allows a high score for Player 1 is that it ignores the fact that, 
while it attracts the play to the vertex 0, it visits all other vertices. In the next section we will construct a 
strategy that recognizes such visits. Thereby, the strategy is able to bound the opponent's scores by 2. 

5 Bounding the Scores in a Muller Game 

In this section, we prove our main result: the finite-time Muller game with threshold 3 is equivalent to a 
Muller game. 

Theorem 14. IfWi is the winning region of Player i in a Muller game (G, ^0,^1 ), and W! is the winning 
region of Player i in the finite-time Muller game (G, ^1 ; 3), then Wi = Wf 

To prove Theorem[T4]we use the following approach. If MaxSc^(_, (p) < 2 for an infinite play p, then 
there exists a prefix w of p that is winning for Player / in the finite-time Muller game with threshold 3. 
Hence, if a winning strategy for Player / in the Muller game bounds the scores of her opponent by 2, then 
this strategy is also winning for the finite-time Muller game with threshold 3. We will show that such a 
winning strategy exists. Theorem [14] then follows by determinacy of Muller games. Therefore, the rest 
of this section will be dedicated to proving the following lemma. 

Lemma 15. Player i has a winning strategy o for her winning region Wi in a Muller game ^ = 
(G, J^o,J^i ) such that MaxSc (Play (v, C7, t)) < 2 for every vertex v € Wi and every T G ITi-;. 
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We will use the internal structure of the winning regions as computed by Zielonka's algorithm to 
give an inductive proof of this claim. Traditionally, Zielonka's strategies forget the history of the play 
every time they switch between an attractor strategy and a recursively computed winning strategy. For 
example, suppose that a play w spends some time in ^ before Player decides to move out of the set 
W"^^. Player 1 responds to this by playing the attractor strategy to the set V \RtLbl(r„_y) in order to 
reach some vertex v G V \RtLbl(r„_j). If v G W" then Player 1 will play the winning strategy for 
the set W" ■'^^ starting at the vertex v. 

Note that the play w may have spent a significant number of steps in W" (while playing according 
to the attractor strategy) before Player 1 begins to play the winning strategy for that set. Yet in Zielonka's 
strategy. Player 1 will behave as if the first vertex visited in W" -'^^ is v. In other words, the suffix of w 
that is contained in is effectively forgotten by the strategy. 

This fact is irrelevant if we are only concerned with constructing a winning strategy, but when we 
want to construct strategies that guarantee certain scores are bounded by 2, the entire suffix of w must be 
retained in this kind of situation. This motivates the following definition of a play. A play begins with a 
finite prefix over which the players have no control, and then continues as a normal play would. The key 
difference is that the strategies may base their decisions on the properties of the prefix. 

Definition 16 (Play). For a non-empty finite path w = wq - ■ - Wn and strategies o G IT,-, T G ITi-,, we 
define the infinite play Play(w, a, t) = P0P1P2 ' ■ ■ inductively by pj = wjfor < j <n and for j > n by 

\T{p(ypj-i) ifpj-ieVi^i 

In fact, the finite paths that are passed to our strategies will not be totally arbitrary. As described 
previously, these paths arise out of decisions made before the strategy was recursively applied. Therefore, 
we have some conti^ol over the form that these paths take. We will construct our strategy so that every 
path passed to a recursive strategy has the following property. 

Definition 17 (Burden). Let ,9' C 2^'. A finite path w is an ^-burden //'MaxSc^(H') < 2 and for every 
F either Sc/r(w) = or Scf (w) = 1 and AcCf (w) = 0. 

We are now ready to prove by induction over the height of the Zielonka tree that both players have 
a strategy to bound their opponent's scores by 2 on their winning regions, even if the play starts with a 
burden. We begin by considering the base case, which is when the Zielonka tree is a leaf. For the rest of 
this section we will assume RtLbl(i2^Q^|) G Otherwise, swap the roles of Player and 1 below. 

Lemma 18. Let (G,^Oi^i) be a Muller game with vertex set V such that 2f,^g^^^ is a leaf. Then, 
Player 1 has a strategy T such that MaxSc_^Q(Play(H'v, a,T)) < 2 for every strategy a G ITq and every 
^Q-burden wv with v G V. 

Proof. As iF^Q is a leaf and RtLbl(iF^Q ) G by assumption, we have ^0 = 0- Hence, any 
strategy t for Player 1 guarantees MaxSc^Q (Play(H', a, t)) < 2. □ 

We now move on to the inductive step of the proof. We will give two versions of the inductive step, 
one case will be for the set Wq and the other will be for the set W\ . We will consider the case for the set 
Wb first. 

The situation in this case is shown in Figure IH Our strategy for this case will be the same as 
Zielonka's strategy, but it must also deal with the finite path that has been passed to it. We denote 
the attractor strategy for Player on A„ \ U„-i by a,f and we denote the recursively computed strategy 
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A2\Ui 




A3\U2 




Figure 4: The structure of Wq. The dashed line indicates a play. 



for Player on G[Wo ] as of. We can assume that of satisfies the inductive hypothesis, which means that 
MaxSc ^^ \ws (Play (wv, of, t)) < 2 for every strategy T for Player 1 in G[Wo'] and every \ Wq -burden 
wv with V G Wq. We define the following strategy o* for Wq: 



Our strategy chooses to use of or o^ precisely when Zielonka's strategy chooses to do so. The 
difference is that our strategy is careful to pass the appropriate finite path to the recursively computed 
strategy of. 

The sets Uj form a sequence of nested traps for Player 1. Therefore, if Player 1 chooses to leave 
some Uj \ Uj-i and Player plays according to o*, the play can never return to Uj \ Uj-i. This implies 
that a play that has left some Wq will never return. Also, every vertex in Aj \ Uj-i can be seen at most 
once, as o* behaves like an attractor strategy on these vertices. The next lemma will be used to deal with 
cases that arise from these observations. 

Lemma 19. Let w be an {F}-burden, let v,v' € F. 

1. Let p be an infinite play in which v appears at most once. Then, MaxScj/7|(H'p) < 2. 

2. Let p be an infinite play such that v is never visited after v' was visited for the first time. Then, 
MaxSc{p}(M'p) < 2. 

Proof. For both statements, it suffices to show that Scf {wx) < 2 for every prefix x of p . Let w = wq • • • w„. 
We consider the two cases given by the definition of a burden: 

• Scf(w) = 1. As Accir(w) = 0, we have Scir(wo---w„_i) = 0. Hence, the suffix Wk---w„ of w 
witnessing Sc^ (w) = 1 is minimal. 

1. As w^; • • • w,, is minimal and as v occurs at most once in p, we conclude that the score for F 
increases at most once after the prefix w. 

2. As the suffix is minimal, the score of F can increase to 2 only by or after visiting v' for the 
first time. But v is then never visited again. Hence, the score for F is bounded by 2. 

• Scf (w) = 0. Let y be the shortest prefix of p such that Sc^ (wy) = 1 . If such a prefix does not exist, 
then we are done. 

1. Otherwise, y V does contain v at most once. Hence, the score for F increases at most once 
after the prefix wy. 

2. Again, if such a prefix exists, then the score for F can reach 2 only by or after visiting v' for 
the first time after wy. But v is then never visited again. Hence, the score for F is bounded 




of{w'v) if V G Wq and w' is the longest suffix of w with Occ(w') C Wq" 
(7„A(v) ifvGA„\f/„_i 



by 2. 



□ 
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We are now able to prove the inductive step for Player 0, by applying the the observations formalized 
in Lemma [19] to the structure of Wq. 

Lemma 20. We have Ma^!^Sc^^^^/g(Play{wv,a* ,t)) < 2 for every strategy T G LIi and every ^\ \ Wq- 
hurden wv with v € Wq. 

Proof. Let p = poPiPi ■■■ = w^^ Play(wv, a*, t). Note that po = v, which is the first vertex where the 
players get to choose a successor. Assume p enters some A„ \ U„-i. Then, it will afterwards enter Un-i 
while seeing every vertex in A„ \ U„-i at most once, as A,, is an attractor and a* behaves like an attractor 
strategy on A„ \ Un~i. Now assume p enters some Wq". Then, it will stay in WJ" until Player 1 decides to 
leave. However, his only choices are vertices in A„_i, as Wq is a trap for him in V \A„_i. Hence, once a 
set A„ \ Un- 1 or Wq is left, it will never be entered again. 

As wpo is an \ Wi -burden, it suffices to show Sc^ (wpo ■ • • p„) < 2 for every n > and every 
F £ \Wi. We will consider several cases for F: remember that either Scf(wpo) = or Sc^ (wpo) = 1 
and Accf(wPo) = 0. 

• F n (U„>i (^n \ Un-i)) / 0: Every vertex in U«>i (^n \ Un-i) occurs at most once in p. Hence, 
Scf (wpo ■■■ Pn) < 2 for every « > by Lemma [T9TT] 

• F Q U„>i Wq" with F n W;j / and F n W^{ / for / < j: p cannot visit Wj after it has visited Wi. 
Thus, Scf{wpQ ■ • • Pn) < 2 for every « > by Lemma [19121 

• F C Wq for some j: If p never visits Wq^, then Scf (wpo • • • p„) = for every n >0. So, assume p 
enters Wq at position p,„ for some m > 0. 

Suppose m = 0: wpo is also an |" Wq"' -burden and wpoPiPi • • • is played according to of until 
Player 1 decides to leave Wq^ at some position p > m. Applying the inductive hypothesis yields 
that af guarantees Scf (wpo • • - p,,) < 2 for every n in the range m <n < p. Should the play leave 

Wq^, then Scf is reset to and stays 0, as Wq^ cannot be visited again. If Player 1 never leaves Wq'', 
then the scores are bounded by 2 throughout the whole play. 

If m > 0, then Scf (wpo • • - p^) = for every n < m. Also, the play PmPm+iPm+2 • • • in Wq^ starts 
with the \ WQ^-burden wpo • • • p,„, (as p„,-i ^ Wq^) and the inductive hypothesis on of guarantees 
Scf (wpo ■■■ Pn) < 2 until Wq"' is left, from which point onwards Scf is always 0. □ 

We now turn our attention to the strategy for Player 1 . For the rest of this section n will be the index at 
which Zielonka's algorithm terminated, and k = BrnchFctr(.^^(,^j ). The situation for Player 1 consists 
of k overlapping instances, one for each child, of the situation depicted in Figure [5] 



Attrf"-^(y\RtLbl(r„_y)) 






-> 

(y\RtLbi(r„_y))nx„_y 


i i wr^'----T''' 

V . i >i, 1 
^1 i 




^ 1 x 





Figure 5: The structure of W\ with respect to Tn-j. The dashed line indicates a part of a play between 
two change points. 
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For the sake of convenience we define Zy = Xj \ RtLbl(r, ) for every j in tlie range n — k<j<n. For 
every j in the range n — k < j < n, we liave an attractor strategy for Player 1 on Attr^ ^ (Zy) \Zy which we 
call Ty^, and we have a recursively computed winning strategy rj for Player 1 on Once again, we 

can assume the inductive hypothesis holds for the strategy zj, which means that MaxSc^^ |^^^_, (Play (wv, a, 

2 for every strategy a of Player in and every \ W/ -burden wv with v G Wy. 

Our strategy improves the strategy given by Zielonka in the sense that it uses a different method for 
choosing a new child of the root. Zielonka's strategy works through the children in a cyclic order, which 
means that when the play enters the set V \RtLbl(ry) the strategy will then move on to the child Ty+i, 
and begin playing either rj^j or ^j^i- By contrast, we will use a more careful method for picking the 
next child of the root that will be considered. 

Our method for picking the next child will make its decision based on which sets of the opponent 
have either non-zero score or a non-empty accumulator. For this purpose, we define the indicator function 
of a play Ind : 2^ as 

Ind(w) = IJ FU [j Accf{w) . 

Scf(>v)>0 Acc,7(h.')^0 

Recall that Lemma |7] implies that the sets we are considering form a chain in the subset relation. 
This implies that the indicator function always gives some subset of a set that belongs to the opponent. 
Therefore, we can argue that there must always exist a child of the root whose label contains the indicator 
set. 

Lemma 21. For every w, there is some j in the range n — k < j <n such that Ind(w) C RtLbl(ry). 

Proof. Lemma |7] implies that there is a maximal set G such that Ind(w) = G, with either Scg(w) > or 
Accf (w) = G for some F € with G F. Hence, Ind(w) C F for some F € J^q, and, by definition of 
^,^0.^1 > there is some child of the root labeled by RtLbl(ry) such that F C RtLbl(ry). □ 

When a new child must be chosen, our strategy will choose some child whose label contains the value 
of the indicator function for the play up to that point. It is also critically important that this condition is 
used when picking the child in the first step. This is the part of the strategy where the finite initial path 
can have an effect on the decisions that the strategy makes. 

We can now formally define this strategy. We begin by defining an auxiliary function that specifies 
which child the strategy is currently considering. We define c : Wy {n — k+ l,...,n,±} as c{e) = _L 
and 



c{wv) 



c{w) ifveRtLbi(r,(,,)) 

j if V ^ RtLbl(r,.(^,,)), Ind(wv) / and j minimal with Ind(wv) C RtLbl(ry) 

j if V ^ RtLbl(r,.(^,,) ), Ind(wv) = and ; minimal with v G RtLbl(ry) 

^ ifv0a-.<y<„RtLbl(ry) 



Now we can define T* for W\ as 

t(wv)* 



tJ-{wv) if c{wv) = j,v G and w' is the longest suffix of w with Occ(h'') C 
Tj^(v) if c(wv) =j,ve RtLbl(ry) \ w/ 
X if c(wv) = _L where x ^Wi with (v,x) G E 
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We will now prove that this strategy has the required properties. Our proof will use the concept of 
a change point, which is a position in a play where the c function changes. More formally, suppose that 
p = P0P1P2 • • • = Play(wv, a, r*) for some ^o-burden wv with v G Wi and a G ITo. Note that po = v, 
which is the first vertex at which the players get to choose the successor. We say that a position r of p is 
a change point, if r = or if c{wpo ■ ■ ■ p,— 1) 7^ c(wpo ■•■pr)- 

Let a; be a finite prefix of an infinite play that is consistent with T such that the last position in ;t; is a 
change point. Moreover, assume that x satisfies the burden property. Our strategy will pick some index j 
such that Ind(x) C RtLbl(r, ). The play will then remain in the set until Player chooses to leave the 
set W(, at which point the strategy attracts to the set V \RtLbl(ry). Once such a vertex is reached, the 
scores for all sets F € ^0 with Scp{x) > are reset to and the accumulator for F is empty for every 
F e ^0 with Accf (x) / 0. While attracting the play to V \RtLbl(ry) the scores for other sets F E #0 
might rise and the accumulators fill up. However, as every vertex in the attractor is seen at most one, we 
are able to show the following: if the play up to a change point is a ^o-burden, then the play up to the 
next change point is also a burden. As a ^0 -burden bounds the scores of Player be 2, this suffices to 
prove that T* bounds Player O's scores by 2. 

Lemma 22. Let p be as above and let r < s be two change-points such that there exists no change point 
t with r <t < s. IfwpQ ■•■ Pr is an \ W\-burden, then so is wpo ■ ■ ■ Ps- 

Proof. From the definition of a change point we get c(po •■•Pt) = c(po • • • Pr) for every t in the range 
r <t <s. 

If c{wpo ■■■Pr) = ±, then pt \J„_ -<:<;■<« ^^'^Lbl(ry), which implies Pt ^ F for every F G ^q. Hence, 
we have Sc^ (wpo • • • p?) = for every r <t < s and every F G ^q. Furthermore, we have either 
Sc/r(wpo • • • Ps) = 0, if F 7^ {ps} and Scp{wpo ■ ■ ■ Ps) = I and Accp{wpo • • • p.^) = otherwise. 

Now, assume c{wpo ■ • • Pr) = j for some j in the range n — k < j < n. Then, there exists an u in the 
range r<u<s such that p^ • • • p„-i is in W^, Pu - ■■ Ps-i is in Attr^ ' {Zj) \Zj, and we have p, ^ RtLbl(r,). 
Note that both parts could be empty. The situation is depicted in Figure |6](cf. also Figure[5]l. 



Attr/' (Z,)\Z, ^RtLbl(rj) 



Figure 6: A part of a play between two change points. 

Furthermore, at positions / in the range r < i < u — I, Player 1 plays according to and positions / 

in the range u < i < s — l,he plays according to tj^. This implies that every vertex in Attif^ (Zj) \ Zj is 
seen at most once in between p^ and Ps, i.e., in the infix p„ • • • P.s-i- 

Finally, let Ind(wpo ■■■pr) = G. If G 7^ 0, then G C RtLbl(r,); otherwise, p^ G RtLbl(ry), both by 
definition of c. 

It suffices to show for every F G \ Wi : 

1. If Scf{wpo • ■ -pr) = 1 and Accf (wpo • • • p,-) = 0, then Sc^ (wpo • ■ -p,) < 2 for all r < f < 5 and 
Scf (wpo---p,) =0. 

2. If Scf{wpQ ■■■pr) = 0, then Scf (wpo • • - p,) < 2 for all r < f < and either Scf{wp(j ■ ■ ■ p^) = 1 and 
Accf (wpo • • • p,) = or Sep (wpo • • • p,) = 0. 

dJ As / F C Ind(wpo • • • Pr), we have F C RtLbl(r,) and hence Scf (wpo • • • p.v) = 0, as p^ G Zj = 
\RtLbl(r,). It remains to show Scf{wpo • • -p^) < 2 for all r < ? < s. We consider several cases for F: 
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• F nZy 7^ 0: as the vertices in Zj are not visited by • • • Ps-i, the score of F cannot increase in this 
interval. 

• F nAttr I (Zj) \ Zj 7^ 0: every vertex in the attractor is seen at most once. Hence, we obtain 
Scf (wpo • • ■ Pf) < 2 for all r < ? < 5' by Lemma [T9ni 

• F C W^: If Pr € Attr^^(Zy) \ Zj, then Scf is reset to at p^ and stays until p.,, hence, we have 
Scf (wpo • • • Pf ) < 2 for all r < ? < 5. 

So, suppose Pr G W/. As wpo ••• Pr is also an \ W/- burden and as F G t W^/> the inductive 
hypothesis on t/ guarantees Scf (wpo ■■■pt) < 2 for every r < t < u. Ks, p, ^ for every f in the 
range u <t < s,\Ne. also have Scf (wpo ■ • -p?) = for these positions. 

121 Let G = Accf (wpo ■■■Pr) ^ Ind(wpo • • • Pr)- Note that G C RtLbl(rj), but it could be the case that 
F 2 RtLbl(ry). Again, we consider several cases for F: 

• If Ps G F, then p., ^ Acc^ (wpo • • • p^), as Accf (wpo • • • p, ) ^ RtLbl(ry) and p^ ^ RtLbl(r,). Hence, 
Scf stays at every position between r and (excluding) s, as the vertex p, is never visited. If 
Scf (wpo • • • Pi) = 1, then Acc/r(H'Po • • • Ps) = 0; otherwise Scf is at position s, too. 

• If Ps ^ F , then Scf (wpo • • • Ps) = 0. To bound the score between the positions r and s by 2, we 
have to consider three subcases: either F r\Zj ^ 0, F n Attrj^(Zy) \ Zj ^ d) or F Q W^. All cases 
can be solved by analogous reasoning to these cases in[T] □ 

Now, to prove the inductive step for Wi , we simply need to observe that the finite path ending at the 
first change point is a burden by assumption. 

Lemma 23. We have MaxSc.^^ ^w, (Play (wv, (J,T*)) < 2 for every strategy o G Hq and every \ W\ - 
burden wv with v G VVi. 

Proof. Let p = w^^ Play(H'v, a, T*). If p contains infinitely many change points, then Lemma[22l implies 
MaxSc,^Q(Play(H'v, a, T*)) < 2 as the play starts with a burden, i.e., there scores are bounded by 2 in wv, 
and in between any two change points, the scores are bounded by 2 as well. If p contains only finitely 
many change points, then Lemma |22] implies that the scores of Player up to the last change point are 
bounded by 2. From that point onwards, Play(wv, a, T*) is consistent with some t(, and the play up to 
that point is a \ W/ -burden, as it is an \ Wi -burden due to Lemma|22] Hence, the scores for every 
set F G \ are bounded by 2 from that point onwards, by the inductive hypothesis on t/. The scores 
of every F £ \Wi with F % W/ are bounded by 1, as vertices not in W/ are no longer visited. □ 

Finally, we can prove Lemma [15] which also completes the proof of Theorem [T4l 

Proof. Theorem [T2l yields that algorithm [T] is correct, i.e. the sets W, returned are indeed the winning 
regions of the players. We prove the following stronger statement by induction over the height of if^^o,^i : 
let y be the vertex set of G. Player i has a winning strategy o for her winning region Wi in G such that 
MaxSc ^^(wv, a, t) < 2 for every strategy T G Hi-; and every ^i-,- I" V -burden wv in G. This implies 
Lemma[T5] as the finite play v for every v G W^- is an ^i-, \ V-burden. 

For the induction start, apply Lemma [18] In the induction step, use the strategies obtained from the 
induction hypothesis to define o* and T* as above and apply Lemma [20l respectively Lemma [23] Both 
strategies are winning, as they bound the scores of the opponent by 2. □ 
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6 Conclusion 

We have presented a criterion to stop plays in a MuUer game after a finite amount of time that preserves 
winning regions. Our bound S'^' on the length of a play improves the bound |G|! + 1 obtained by a 
reduction to parity games. Furthermore, our techniques show that the winning player can bound the 
scores of the opponent by 2 and that this bound 2 is tight. 

However, it remains open whether a play can also be stopped after a score of 2 is reached. As the 
winning player cannot always avoid a score of 2 for the opponent, one has to show that the winning 
player always reaches a score of 2 for one of her sets before the opponent reaches score 2 for one of 
his sets. Our approach does not seem to be suitable for this, as the notion of a burden is not sufficient 
for this goal. Furthermore, it is unclear how to strengthen the definition while still retaining Lemmata 
corresponding to Lemma [20l and Lemma [23] 

A finite-time MuUer game with threshold ^ is a reachability game (in the unraveling of the original 
arena up to depth at most k^^^), which can be solved with simple algorithms. Another interesting direc- 
tion for research is to find a construction which turns a winning strategy for a finite-time Muller game 
with threshold 3 (or 2, if it is equivalent) into a finite-state strategy for the original Muller game. It is 
conceivable that such a construction would yield memory structures that are optimized for a given arena, 
something which does not hold for the LAR respectively Zielonka tree structures. 
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